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ABSTRACT 

Toscana virus is an emerging bunyavirus in Mediter- 
ranean Europe where it accounts for 80% of pediatric 
meningitis cases during the summer. The negative- 
strand ribonucleic acid (RNA) genome of the virus 
is wrapped around the virally encoded nucleopro- 
tein N to form the ribonucleoprotein complex (RNP). 
We determined crystal structures of hexameric N 
alone (apo) and in complex with a nonameric single- 
stranded RNA. RNA is sequestered in a sequence- 
independent fashion in a deep groove inside the hex- 
amer. At the junction between two adjacent copies of 
Ns, RNA binding induced an inter-subunit rotation, 
which opened the RNA-binding tunnel and created 
a new assembly interface at the outside of the hex- 
amer. Based on these findings, we suggest a struc- 
tural model for how binding of RNA to N promotes the 
formation of helical RNPs, which are a characteristic 
hallmark of many negative-strand RNA viruses. 

INTRODUCTION 

Toscana virus is an enveloped negative-strand ribonucleic 
acid (RNA) virus that belongs to the genus phlebovirus 
in the Bunyaviridae family. Members of this genus also in- 
clude Rift Valley Fever virus (RVFV), Sandfly Fever Si- 
cilian virus, Severe Fever with Thrombocytopenia Syn- 
drome virus and Uukuniemi virus. These viruses are etio- 
logical agents of a wide range of illnesses ranging from self- 
limiting febrile episodes to hemorrhagic fevers character- 
ized by multi-organ failure and death (1-4). The sporadic 
outbreaks of these viruses, coupled with their widespread 
distribution across all continents, make them of great im- 



portance to public health. The genome of the virus is en- 
coded in three segments of varying sizes known as the large 
(L), medium (M) and small (S) segments that encode the 
viral polymerase, the envelope glycoprotein and the nucleo- 
protein (N), respectively (3,5). The S segment also encodes 
a virulence determinant known as non-structural small pro- 
tein NSs in the sense direction. 

The genome of phleboviruses, like other bunyaviruses, 
is tightly packed with N into ribonucleoprotein complexes 
(RNPs) (6). The RNP complex, rather than the free RNA, 
represents the transcription and replication template for 
the viral RNA-dependent RNA polymerase (RdRp). The 
RdRp docks on the RNP to synthesize viral messenger 
RNA (mRNA) and complementary antigenomic RNA (6). 

Bunyaviruses replicate in the cytoplasm of host cells. For 
the initiation of transcription, RdRp employs an endonu- 
clease activity to cleave and 'snatch' the caps of host cell 
mRNA to prime viral transcription. This mechanism has 
been demonstrated for La Crosse virus and, by sequence 
comparison, inferred for all bunyaviruses (7-8). 

Unlike other segmented viruses, members of the Bun- 
yaviridae lack a matrix protein that connects the lipid 
envelope-associated glycoproteins to the RNP. Instead, in 
Uukuniemi virus, N interacts with the cytoplasmic tails of 
its cognate glycoproteins. This suggests that the bunyavirus 
N may, in part, fulfill the role of the missing matrix pro- 
tein (9-10). The molecular architecture and RNA-binding 
mechanism of several negative-strand RNA viruses have 
recently been characterized. The N proteins of the Rhah- 
doviridae and Paramyxoviridae crystallized in rings, with the 
RNA bound in a tunnel formed by two lobes of the same 
monomer (6,11-13). The Lassa virus nucleoprotein has 
an exonuclease activity (14-15) that is specific for double- 
stranded RNA (14,16), further indicating that nucleopro- 
teins can have a wider role beyond acting as scaffolds for 
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the RdRp (14-15). Structures of orthobunyaviruses N, e.g. 
of Bunyamwera, Schmallenberg, Leanyer and La Crosse 
viruses (17-21), have revealed unique structural folds. In 
these cases, the RNA is bound in a positively charged cleft 
that is formed by two a-helical lobes. Oligomerization is me- 
diated by both N- and C-terminal arms. 

Recent studies have also provided insights into the mech- 
anism of phlebovirus RNP assembly (22-24). The first 
structure of the RNA-free RVFV N showed a dimer, where 
an amino-terminal arm is folded back into the predicted 
RNA-binding pocket (24). This dimer is thought to repre- 
sent an autoinhibited form of N. A second RNA-free struc- 
ture of RVFV N showed a hexamer in the crystals (22). 
Crystal structures of RNA-bound multimers of RVFV N 
ranging from tetramers to hexamers showed how a polyuri- 
dine RNA sequence is nonspecifically bound between two 
helical lobes (23). 

In this study, we have elucidated structures of the hexam- 
eric RNA-free (apo) N and of hexameric N in complex with 
a single-stranded nonameric RNA. Our structural studies, 
combined with functional experiments, suggest a molecular 
mechanism of how RN A binding to N induces the transi- 
tion from a hexameric ring to a helical RNP assembly. 

MATERIALS AND METHODS 

Plasmids 

A codon-optimized complementary deoxyribonucleic acid 
(cDNA) of the Toscana virus N gene (GenBank acces- 
sion code X53794) was commercially synthesized (Gen- 
script) and supplied in the pUC57 vector. For bacte- 
rial expression as amino-terminal His6-fusion, the gene 
was cloned into pET15b or pET46-ek/LIC vectors (No- 
vagen). RVFV N was expressed from the pET21 vec- 
tor (Novagen). Plasmids for intracellular reconstitution of 
RVFV RNPs, pI.18-RVFV_L, pI.18-RVFV_N, pI.18-HA- 
PKRAE7 and pHH21-RVFV-vMRen were described pre- 
viously (25). The pI.18-derived plasmids expressing RVFV 
N mutants Y30A, K74A, Y30A/K74A, Q198A, R185D, 
R185D/D167A were constructed by site-specific mutagen- 
esis. The firefly luciferase control plasmid pGL3 was pur- 
chased from Promega. 

Expression and purification 

The recombinant plasmids were transformed into BL21 
DE3 or BL21 DE3 Rosetta cells. Bacteria were cultured in 
Luria Broth (LB) medium at 37°C until an OD 600 of 0.4, 
followed by a temperature shift to 20° C. The protein was 
produced for 18 h in the presence of 200-|xM isopropyl (3- 
D-l-thiogalactopyranoside. Bacteria were collected by cen- 
trifugation and resuspended in lysis buffer containing 20- 
mM TRIS (pH 8), 10-mM imidazole, 300-mM NaCl and 
complete ethylenediaminetetraacetic acid-free protease in- 
hibitors, followed by cell disruption in a microfluidizer (Mi- 
crofluidics). Lysates were cleared by centrifugation at 40 000 
g for 30 min at 4°C. 

The supernatant was loaded on a 30-ml gravity flow col- 
umn filled with 3-ml NiNTA resin pre -equilibrated with ly- 
sis buffer. The column was washed with 20-ml lysis buffer, 
followed by 2 5 -ml washing steps with equilibration buffer 



(20-mM TRIS, pH 8.0, 500-mM NaCl) containing increas- 
ing concentrations of imidazole (10 mM, 25 mM and 75 
mM). The protein was eluted from the column using 12.5- 
ml equilibration buffer containing 250-mM imidazole. N 
was further purified via size exclusion chromatography on a 
Sephadex 200 10/300 column (GE Healthcare) equilibrated 
with TRIS-buffered saline (pH 7.5). 

To remove heterogeneously bound RNA from Es- 
cherichia coli, N bound to the NiNTA column was dena- 
tured by incorporating 8-M urea in the washing buffer. 
Bound RNA was washed off using high salt buffer (20- 
mM TRIS, pH 7.5, 500-mM NaCl) and N eluted by 20-mM 
TRIS, pH 7.5, 250-mM imidazole and 500-mM NaCl. 

The fraction of bound RNA was determined by the 
A260/A280 ratio before and after refolding. The refolded N 
had an A260/A280 ratio of 0.6 while the native RNP complex 
purified directly from E. coli had a ratio of ~ 1.5 indicating 
about 10% bound nucleic acid. 

The theoretical isoelectric point of N is 9.2 (26). Cation 
exchange chromatography was used to further purify the 
refolded N and remove partially folded intermediates. To 
this end, the refolded protein was dialyzed against 20-mM 
TRIS, pH 7.5, 50-mM NaCl and bound to a MonoS col- 
umn (GE Healthcare). Protein was eluted using a salt gra- 
dient from 50-mM to 1000-mM NaCl. 

Circular dichroism 

To verify the integrity of the refolded N, circular dichro- 
ism (CD) measurements were performed at room tempera- 
ture using a protein concentration of 1 mg/ml in phosphate 
buffered saline, pH 7, in an Aviv 62 A spectropolarimeter. 

Reconstitution of the RNP complex 

For reconstitution of a homogenous RNP, the refolded N 
was incubated at room temperature for 1 h with commer- 
cially synthesized nonameric RNA (UGUGUUUCU, IDT 
DNA Technologies Inc.), at a molar ratio of 1:1. A final 
gel filtration step was added to remove unbound RNA. The 
stability of the reconstituted complex was monitored by gel 
filtration and also on a native gel. The A 2 6o /A280 ratio of the 
reconstituted RNP complex was 1.1, which corresponds to 
approximately 5% bound RNA. 

Electron microscopy 

N or the indicated N mutants at a final concentration of 
1.5 mg/ml were mixed with an in v^ro-transcribed 8-kb 
single-stranded RNA comprising a Green Fluorescent Pro- 
tein (GFP) expression construct (27) at a final concentration 
of 0.15 mg/ml. The sample was 10-fold diluted with 20-mM 
TRIS, pH7.5, 1 50-mM NaCl, and 15 |xlof this solution was 
spotted onto carbon-coated grids. The sample was stained 
with 2% uranyl acetate and images were acquired on a Zeiss 
EM910 at magnifications between 12.5 and 31.5 k. 

Crystallization and data collection 

Initial crystallization trials were carried out in 96-well for- 
mats using a Phoenix (Art Robbins Instruments) and Topaz 
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system (Fluidigm Corp, South San Francisco, CA, USA). 
Various commercially available screens, including PACT, 
PEG ion, Crystal (Qiagen) and Index Screen (Hampton Re- 
search) were used. Initial hits were translated to the hanging 
drop technique using a drop size of 1 (xl + 1 |xl at a pro- 
tein concentration of 18-20 mg/ml. N crystals were grown 
in 6.5% PEG 3350, 0.1-M sodium malonate, pH 4.5-5.2, 
while RNP crystals were grown in 11% PEG 3350, 0.1-M 
sodium malonate, pH 4.5-5.2. Diffraction data of single 
crystals were collected at BESSY-II BL14.1 and processed 
using XDS and MOSFLM software (28-29). 

The structure of Toscana virus N was solved by molecular 
replacement using a homology model of the RVFV N (PDB 
code 30UO) calculated by the SWISS-MODEL server as a 
search model (30). Both Phaser and MOLREP programs 
were used (31-32). The phase problem for the RNP was 
solved by molecular replacement with the refined Toscana 
virus N as the search model using Phaser. 

Both models were built with COOT (33) alternating with 
refinement in PHENIX (34). For the RNP, positive density 
corresponding to the bound RNA was clearly visible after 
the first round of refinement. For both models, we employed 
dihedral restraints based on the high-resolution structure of 
N from RVFV. Strict non-crystallographic symmetry (NCS) 
was used at the beginning of the refinement and then grad- 
ually released. Translation, Libration and Screw (TLS) was 
used in the final rounds of refinement. Molprobity was used 
for model validation (35). Figures were prepared using Py- 
MOL (The PyMOL Molecular Graphics System, Version 
1.3, Schrodinger, LLC) and the pymol plugin VASco (36). 

Isothermal titration calorimetry (ITC) 

ITC experiments were carried out at 8°C in a VP-ITC or 
iTC200 (Microcal) using degassed 20-mM TRIS, pH 7.5, 
150-mM NaCl. N was diluted and RNA resuspended in 
this buffer prior to the measurement. A 400- u,M solution 
of single-stranded 6-9-mer RNAs was titrated into a solu- 
tion containing 30-(jlM apo N. Binding isotherms were fit- 
ted and equilibrium dissociation constants were calculated 
using the Microcal ORIGIN software. 

Deuterium exchange mass spectrometry 

To obtain optimal fragmentation sequence coverage of 
Toscana virus N, optimal quench conditions were estab- 
lished as previously described (37). Deuterium exchange 
experiments were carried out by mixing 2 julI of Toscana 
virus N stock solution with 6-jjlI D 2 0 buffer (8.3-mM TRIS, 
pH 7.2, 150-mM NaCl in D 2 0) and incubation at 0°C 
for 10 s and 1000 s. At the indicated time points, the ex- 
change reaction was quenched by adding 12 julI of ice-cold 
optimal quench solution [0.05-M guanidinium hydrochlo- 
ride, 15-mM TRIS (2-carboxyethyl)phosphine, 16.6% glyc- 
erol, 0.8% formic acid (FA)], immediately frozen on dry ice 
and stored at -80° C. The non-deuterated control samples 
and equilibrium-deuterated control samples were also pre- 
pared by mixing protein with H 2 0 buffer (8.3-mM TRIS, 
pH 7.2, 150-mM NaCl in D 2 0) and equilibrium-deuterated 
buffer (0.8% FA in 99.9% D 2 0). The frozen samples were 
then thawed at 5°C and passed over an immobilized pepsin 



column (16-|jl1 bed volume) at a flow rate of 20 |jil/min, 
and the resulting peptides were collected on a C18 trap 
for desalting and separated by a Magic C18AQ column 
(Michrom BioResources Inc.) using a linear gradient of ace- 
tonitrile from 6.4% to 38.4% over 30 min. Mass spectrome- 
try (MS) analysis was performed using Liquid Chromatog- 
raphy Quadrupole (LCQ) Classic mass spectrometer from 
Thermo Finnigan, with a capillary temperature of 200°C. 
Data were acquired in both data-dependent MS/MS mode 
and MSI profile mode, and the data analyzed by SEQUEST 
(Thermo Finnigan Inc.) and DXMS explorer (Sierra Ana- 
lytics Inc., Modesto, CA, USA). 

Minireplicon assay 

Assays to measure activity of reconstituted RVFV RNPs 
were essentially performed as described (38). Briefly, sub- 
confluent monolayers of 293 cells seeded in 12-well dishes 
were transfected with pI.18-RVFV_L, pI.18-RVFV_N (or 
mutants thereof), pI.18-HA-PKRAE7 and pHH21-RVFV- 
vMRen (0.5 |jig each), and the firefly luciferase control 
pGL3 (0.05 |JLg), using Nanofectin (PAA). At 24-h post- 
transfection, cell extracts were tested for Renilla and fire- 
fly luciferase activities using the Dual luciferase method 
(Promega). 

RESULTS 

Structure and oligomerization of Toscana virus N 

N of Toscana virus was expressed in E. coli and purified 
via affinity chromatography. The resulting protein was as- 
sociated with E. coli nucleic acids, as apparent by the ob- 
served A 260 /A 2 go ratio of 1.5. This preparation yielded pro- 
tein crystals that showed only weak, anisotropic diffraction. 

Using an unfolding/refolding protocol, nucleic acid-free 
N was obtained, as judged from the A 2 6o/A 2 go ratio of 0.6 
(Supplementary Figure SI A). Correct folding of the protein 
was confirmed by CD spectroscopy (Supplementary Figure 
SIB). Analytical gel filtration experiments combined with 
static light scattering indicated a tendency of the protein 
to self-assemble even in the absence of RNA (Figure 1A), 
unlike the N of the closely related RVFV, which exists as a 
dimer in solution (24). 

It was previously reported that Ns of bunyaviruses bind 
with high affinity to the highly conserved 5' and 3' se- 
quences of their genomic RNA segments (39). In agreement 
with this, N of Toscana virus bound to a nonameric single- 
stranded RNA sequence corresponding to the 3' genomic 
RNA consensus with an affinity of 560 nM in ITC exper- 
iments (Supplementary Figure S2A and Table 1). Only a 
weak binding signal was obtained for a 9-mer RNA with the 
5' genomic RNA consensus sequence (Supplementary Fig- 
ure S2B). Interestingly, Toscana virus N bound with highest 
affinity (K^ = 260 nM) to a control 9-mer RNA correspond- 
ing to the reverted 3' consensus sequence (termed oligol; 
Figure IB). In these titrations, a binding stoichiometry of 
0.5 was derived, indicating that only every second N bound 
an RNA molecule. The observed RNA-binding affinities 
were approximately 10-fold lower compared to those re- 
ported for RVFV Ns (23). This difference is likely related to 
the lower salt concentration and the longer RNA oligomers 
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Figure 1. Structure and oligomerization of the Toscana virus N protein. (A) Gel nitration coupled to right angle light scattering (RALS) analysis of the 
Toscana virus N. Ultraviolet (UV) absorbance is shown in red and refers to the left y-axis, while the apparent molecular mass (in multiples of the molecular 
weight of the monomer) is shown in green and refers to the right y-axis. (B) ITC experiments were carried out in 20-mM TRIS, pH 7.5, 150-mM NaCl at 
8°C to determine equilibrium dissociation constants of the nonameric single-stranded RNA oligol (UGUGUUUCU; Table 1) to the hexameric N (K4 = 
260 nM ± 20 nM, n = 0.49 ± 0.01). Errors represent standard deviations derived from the fittings. (C) Electron micrographs of negatively stained, 8000-bp 
single-stranded RNA, Toscana virus N and in vitro reconstituted Toscana virus RNP particles, respectively. (D) Structure of the Toscana virus N, which 
is comprised of a two-lobed core domain and an amino-terminal arm comprising al and a2. These helices are linked to the core domain by hinge I. (E) 
Ribbon representation of the hexameric Toscana virus N showing that the N-terminal arm mediates oligomerization by wrapping around the neighboring 
subunit. The six-fold symmetry axis of the N hexamer is indicated. (F) Hydrophobic surface representation in which hydrophobic residues are shown in 
green and polar residues in yellow. The amino-terminal arm of the neighboring subunit is shown in stick representation. Ile23, Trp26 and Phe30 are inserted 
in pocket I while Leull, Phel3 and Leul4 bind into pocket II. 



used in the previous study since these authors reported sim- 
ilar RNA-binding affinities for Toscana and RVFV N in 
their experimental setup. Furthermore, RVFV N bound to 
oligol with an almost identical affinity (Kn — 420 nM) com- 
pared to Toscana virus N in ITC experiments (Supplemen- 
tary Figure S2H). 

To further analyze the sequence-specificity of N, base pair 
substitutions were introduced in oligol. The exchange of 



guanine 2 (G2) to adenine did not have a significant effect 
on the binding affinity (Supplementary Figure S2C). Fur- 
thermore, the affinity for a 9-mer polyU sequence was only 
slightly reduced compared to the original 9-mer sequence 
(Supplementary Figure S2D), indicating a certain degree of 
plasticity in RNA recognition. 

We then reconstituted Toscana virus RNPs in vitro using 
an 8000-bp single-stranded RNA. By negative-stain elec- 
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Table 1. RNA sequences used in the study and binding parameters to Toscana virus N 



Oligo 


Sequence 


Binding affinity 


Stoichiometry 


Oligol (used for crystallization} 


UGUGUUUCU 


260 nM ± 20 nM 


0.49 ± 0.01 


01igo2 (3' genomic consensus 


UCUUUGUGU 


560 nM ± 70 nM 


0.43 ± 0.01 


sequence) 








01igo3 (5' genomic consensus 


ACACAGAGA 


not determined 


not determined 


sequence) 








OHgo4 (G2A mutant of oligol) 


UAUGUUUCU 


290 nM ± 40 nM 


0.49 ± 0.01 


01igo5 (polyU 9mer) 


UUUUUUUUU 


360 nM ± 40 nM 


0.40 ± 0.01 


01igo6 (8-mer RNA derived from 


UGUGUUUC 


710 nM ± 30 nM 


0.51 ± 0.01 


oligol) 








OHgo7 (7-mer RNA derived from 


UGUGUUU 


2.7 u,M ± 0.2 |xM 


0.52 ±0.01 


oligol) 








OHgo8 (6-mer RNA derived from 


UGUGUU 


8.0 (jiM ± 0.9 (jlM 


0.50 ±0.03 


oligol) 








Y32A + oligol 


UGUGUUUCU 


1.6u,M±0.2 jjlM 


0.46 ± 0.01 


K79A + oligo 1 


UGUGUUUCU 


220 nM ± 30 nM 


0.42 ± 0.01 


Y32A-K79A + oligo 1 


UGUGUUUCU 


6 (jlM ± 2 jjlM 


0.43 ± 0.05 


K204A + oligol 


UGUGUUUCU 


250 nM ± 10 nM 


0.41 ±0.01 


RVFV N +oligo 1 


UGUGUUUCU 


400 nM ± 100 nM 


0.70 ± 0.02 



tron microscopy, we observed filamentous RNP scaffolds 
with a diameter of 10-12.5 nm that may have an underlying 
helical sub-pattern (Figure 1C). The morphology of these 
RNPs was reminiscent of that of orthobunyavirus RNPs 
(18-21,40), but differed from those reported for RVFV that 
showed a more extended conformation (23-24). 

Crystals of N in the absence and presence of the con- 
trol 9-mer RNA were obtained and diffracted to a maxi- 
mal resolution of 3.3 A and 2.6 A, respectively (Table 2). 
The phase problem was solved by molecular replacement. 
The apo structure contained 12 molecules of N in the asym- 
metric unit, which were organized in two hexameric rings. 
The RNA-containing N crystals had 24 N and 12 RNA 
molecules in the asymmetric unit, which were arranged in 
four almost identical hexameric rings (Supplementary Fig- 
ure S3A and B). The use of non-crystallographic symmetry 
improved electron density during refinement and allowed 
the unambiguous modeling of the complete 9-mer RNA 
in the RNP crystals. The apo structure was refined to an 
^work/^free of 23.9%/26.7% and the RNP structure to an 
^work/^ftee of 20.3%/24.0% (Table 2). As in the RVFV N 
structure (22), each N monomer consisted of a core domain 
comprised of two a-helical lobes (the upper and the lower 
lobe), and an amino-terminal a-helical arm which was di- 
vided by a short linker, hinge II, in cd and a2 (Figure ID 
and Supplementary Figure S4). 

In the hexameric rings of the apo structure, an almost 
perfect six-fold symmetry between the six monomers was 
observed (Figure IE). Oligomerization was driven by the 
amino-terminal arm, which bound into a hydrophobic 
groove of the neighboring monomer at the outside of the 
hexamer (pockets I and II in Figure IE and F, and Supple- 
mentary Figure S3). This interaction constituted the ma- 
jor oligomerization contact, as evident by the buried sur- 
face area of 1 530 A 2 (the total buried surface between two 
monomers is 1690 A 2 ). The interaction of helix al with the 
neighboring monomer included hydrogen bonds and hy- 
drophobic contacts, and a salt bridge between Lys83 and 
Glu29 of the opposing monomer (Supplementary Figure 
S5A). A similar oligomerization mode has been reported for 
RVFV RNPs (22-23). It also agrees with a recent structure 



of hexameric RNA-free Toscana virus N which was pub- 
lished during the preparation of this manuscript (23). 

To probe dynamic features of the hexamer, 
hydrogen/deuterium exchange experiments were per- 
formed. These studies showed that in particular residues 
16-24, corresponding to hinge II of the amino-terminal 
arm, underwent rapid hydrogen exchange in the N hexamer 
(Supplementary Figure S5B), pointing to a highly dynamic 
association of this region in solution. 

Sequence-independent RNA binding in Toscana virus N 

Also in the RNA-bound RNP crystals, oligomerization of 
N into hexamers was mediated by the amino-terminal a- 
helical arm, which filled the hydrophobic groove of the core 
domain at the outside of hexamer (Figure 2A). In agree- 
ment with our ITC data, only every second N bound an 
RNA molecule, whereas the central RNA-binding cleft of 
the neighboring monomer was empty. In all 12 RNA-bound 
Ns of the asymmetric unit, the 9-mer RNA occupied a de- 
fined position in a positively charged cleft at the inside of 
the hexameric ring, where the first and the last residue of 
the RNA could clearly be distinguished in the electron den- 
sity (Figure 2A and B and Supplementary Figure S6). This 
argues for a preferred binding mode of the 9-mer RNA to 
the N hexamer. 

The orientation of the RNA is evident from the asym- 
metric distribution of the two guanine (G) bases at posi- 
tions 2 and 4 which could be discriminated from the remain- 
ing pyrimidine bases in the 9-mer RNA (Figure 2B). The 
RNA bases are deeply buried in the interior of the RNA- 
binding groove and not accessible from the outside. Some 
of the bases stacked against each other, for example G4 and 
U5, and CI, U8 and U9 (Figure 2C and D). All bases pro- 
jected toward the protein, with the exception of G4 which 
pointed toward the interior of the hexameric ring. Tyr32 
stacks against the base of G2, while Tyr32 of the neighbor- 
ing subunit stacks against the base of uridine 9 (U9) (Figure 
2C). In this way, each second N sequesters seven ribonu- 
cleotides, while two overlapping nucleotides are bound by 
neighboring subunits. The resulting steric restraints may ex- 
plain why only every second N is occupied by RNA. In ad- 
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Table 2. Data collection and refinement statistics 



Apo nucleoprotein RNP 



Data collection 






Space group 


P6, 12 protein molecules/asymmetric unit 
(ASU) 


PI, 24 protein and 12 RNA molecule/ASU 


Cell dimensions 






a, b, c (A) 


104.5, 104.5, 510.8 


98.8, 127.8, 170.5 


a,P,y(°) 


90, 90, 120 


82.1, 79.7, 74.5 


Wavelength (A) 


0.91841 


0.91841 


Resolution (A) 


100-3.32 (3.41-3. 32) a 


34.4-2.6 (2.61-2.59) 


Number of observed reflections 


193 157 (11 635) 


514 757 (67 736) 


Number of unique reflections 


45 886 (3257) b 


229 780(31 175) 


Rsym (%) b 


12.4(50.8) 


9.2(55.5) 


I/al 


10.7(2.4) 


11.2(2.1) 


Completeness (%)* 


99.1 (94.5) 


94.7(79.7) 


Redundancy 


4.2 (3.6) 


2.2 (2.2) 


ASU content 


2 x (N 6 ) 


4 x (N6-RNA (3x9mer) ) 


Refinement 




Resolution (A) 


34.1-3.32 (3.39-3.32) 


34.4-2.6 (2.63-2.60) 


Number of reflections 


45 879 (2741) 


229 737 (4788) 


^work / ^free (°/») C 


23.9/26.7(34.3/36.3) 


20.3/24.0 (29.3/35.4) 


Number of protein atoms 


22 846 


45 778 


Averaged B-factor protein (A 2 ) 


73.1 


46.8 


Averaged B-factor RNA (A 2 ) 


NA 


44.3 


RMS deviations 






Bond lengths (A) 


0.003 


0.005 


Bond angles (°) 


0.71 


0.847 


Residues in favored region of the 


95.0 


97.1 


Ramachandran plot d (%) 







a Values given in parentheses refer to reflections in the outer resolution shell. 

b i? sym = S I I{k) — </> I / £ I(k), where I{k) and </> represent the scaled intensity values of the individual measurements and the corresponding mean 
values. 

c For calculation of Rf IIX , 5% of all reflections were omitted from refinement. 
d As analyzed by Molprobity (35). 



dition to Tyr32, other residues contacted the RNA bases. 
For example, Phe35 stacks against U3, and Ser211, SerllO 
and Argl91 interacted via side-chain interactions with the 
bases of Ul, U6 and C7, respectively. Thr201 interacts with 
the base of G4 via a main-chain interaction (Figure 2C). 

The phosphate backbone of the 9-mer RNA is recog- 
nized by several positively charged and polar residues. For 
example, Argl 1 1 and Lys72 formed salt bridges to the phos- 
phates of G4 and U6, respectively. Interestingly, Lys204 and 
Lys79 formed hydrogen bonds to the phosphate backbone 
at the 5' end of the RN A whereas they do not interact with 
the phosphate backbone at the 3' RNA end (Figure 2C). 

Assuming an identical conformation of the phosphate 
backbone, we modeled the nonameric sequence of the 5' 
non-coding region (oligo3 in Table 1) in the RNA-binding 
groove (Supplementary Figure S7). Due to the higher abun- 
dance of purine bases in oligo3, steric clashes were pre- 
dicted. This may explain the reduced affinity of this se- 
quence to N. 

During the preparation of this manuscript, tetra-, penta- 
and hexameric structures of the RVFV N bound to 28-36- 
mer polyU RNA sequence were reported (23). A compar- 
ison with our RNP structure indicates that the gross fea- 
tures of RNA recognition are conserved between Toscana 
virus and RVFV. Tyr32 has a direct counterpart in RVFV 
N (Tyr30), which similarly stacks with the guanine and uri- 
dine bases, respectively. Also the phosphate backbone in- 



teracts in a similar fashion with the protein in both struc- 
tures; for example Lys72 and Argl 11 in Toscana virus N 
are conserved in both viruses and form similar contacts to 
the phosphates. However, species-dependent differences ex- 
ist. In RVFV N, Arg70 contacts the phosphate backbone 
whereas the corresponding alanine in Toscana virus N is 
not involved in RNA binding. Vice versa, Lys204 at the 
5' RNA end in our RNP forms a salt bridge to the phos- 
phate of RNA residue 1 whereas the corresponding glu- 
tamine residue in the RVFV RNP structure does not partic- 
ipate in RNA binding (see also below). Arg99 in the RVFV 
N complex makes a prominent contact with Tyr32 and ad- 
ditionally contacts the phosphate backbone. Somewhat sur- 
prisingly, the corresponding residue in Toscana virus N, 
Argl 04, is not involved in RNA binding in our model. Inter- 
estingly, G4 in the Toscana virus N structure is rotated com- 
pared to the corresponding uridine in RVFV N and faces 
the solvent channel, which might represent a mechanism 
to allow base plasticity at this position. Our results sug- 
gest that the general mode of RNP assembly is conserved in 
phleboviruses, while minor species-specific differences may 
exist. 

The RNA-binding tunnel 

The availability of both apo and RNA-bound N structures 
allows for the characterization of conformational changes 
associated with RNA binding. In the apo- and the RNA- 
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Figure 2. RNA binding to the N liexamer. (A) Surface representation of the Toscana virus RNP complex showing three RNA molecules in red bound to 
the inside of the hexameric ring. Binding of RNA induces a change from a six-fold to a three-fold symmetry (triangle). (B) 2F 0 -F C map contoured at 2<r 
showing electron density for the bound RNA oligomer. The asymmetric sequence allowed unambiguous fitting of the nonameric RNA into the electron 
density. (C) Cartoon representation showing selected contacts of N with the RNA. The RNA is capped on either side by Tyr32 of adjacent monomers. 
Residues depicted in white belong to the adjacent subunits (see Figure 2A for chain numbering). (D) A schematic diagram of the orientation of the RNA 
bases in the RNA-binding groove. RNA bases are recognized by both main and side-chain contacts. 



bound forms, the core domains of N excluding the amino- 
terminal arm superimpose with a root-mean-square devia- 
tion of 0.4 A indicating that RNA binding induces only mi- 
nor conformational changes within the a-helical core. How- 
ever, the amino-terminal arm of the RNA-bound subunits 
is considerably displaced against the N core domain when 
compared to the apo structure or the RNA-free Ns in the 
RNP (Figure 3A and Supplementary Figure S6). In the apo 
N structure, Tyr32 from hinge I, Lys204 from the lower 
lobe and Lys79 from the upper lobe of the adjacent sub- 
unit are in close proximity to each other and cap the RNA- 
binding tunnel (Figure 3B). Due to their direct RNA in- 
teraction (Figure 3A), Tyr32 and Lys79 become separated 
from Lys204 by approximately 15 A (Figure 3C). Interest- 
ingly, this extensive separation is only observed at the inter- 
subunit junction at the 5', but not at the 3' RNA end, where 
Lys79 and Lys204 do not reach the phosphate backbone 
(Figure 2C and Supplementary Figure S8). 

Mutagenesis was performed in order to assess the impor- 
tance of Tyr32, Lys79 and Lys204 for RNA binding. The 
Y32A mutant bound the 9-mer RNA with six-fold lower 



affinity (K d =1.6 (jlM) compared to wild-type N (Figure 
3D). In contrast, the K79A and K204A mutation did not 
significantly affect the RNA-binding affinity (Figure 3D). 
Interestingly, the simultaneous mutation of both Tyr32 and 
Lys79 significantly diminished the RNA-binding affinity 
(Kd — 6 |jlM), more than the single Y32A mutation, sug- 
gesting that these two adjacent residues cooperate in RNA 
binding. All three single mutants were able to form RNPs 
in vitro using a synthetic single-stranded RNA as a tem- 
plate, although RNPs formed by the K204A mutant ap- 
peared somewhat shorter compared to wild type (Figure 
3E). The Y32A/K79A double mutant assembled into struc- 
tures more reminiscent of aggregates than functional fila- 
ments, suggesting some problems in proper RNP assembly. 

The importance of these residues for viral RNP activ- 
ity was assessed using a system for intracellular reconsti- 
tution of viral RNPs from transfected cDNA plasmids, a 
so-called minireplicon assay. Since this assay has not been 
established for Toscana virus, we initially tested a heterol- 
ogous approach using the RVF V RdRp and N of Toscana 
virus to drive the replication of a reporter gene (41). How- 
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Figure 3. Conformational changes in the RNA-binding tunnel. (A) Superposition of an apo (brown) and RNA-bound (blue) N monomer (bound RNA 
is not shown). In these two structures, the amino-terminal arm is displaced relative to the core domain. (B) Cartoon representation showing the Tyr32- 
Lys79-Lys204 triad, which is situated in the RNA tunnel in the apo structure. (C) Stacking of Tyr32 against the guanine base of G2 and binding of Lys79 
and Lys204 to the phosphate backbone of the RNA creates an open RNA-binding tunnel, as shown in the RNP structure. (D) ITC-binding curve of 
the 9-mer RNA oligol to the following N mutants: Y32A (Ad = 1.6 jjlM ± 0.2 u.M, n = 0.46 ± 0.01); K79A (A d = 220 nM ± 30 nM, n = 0.42 ± 0.01); 
Y32A-K79A (A d = 6 |xM ± 2 |xM, n = 0.43 ± 0.05); K204A (A d = 250 nM ± 10 nM, n = 0.41 ± 0.01). (E) EM microscopy of reconstituted RNP particles 
of the Toscana virus Y32A, K79A, Y32A-K79A and K204A mutants, as described in Figure 1C. (F) Minireplicon assay of RVFV with the indicated N 
mutants. Cells were transfected with constructs for RVFV L, a model minigenome expressing a reporter gene, RVFV N, and the indicated N mutants. The 
reporter values of the minireplicon system reflect the activity of the recombinant RNPs reconstituted from expressed cDNA sequences. 
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ever, we were unable to detect any reporter activity in this 
hybrid system (data not shown). 

We then switched to the well-established RVFV minirepli- 
con system and introduced mutations to equivalent residues 
within the RVFV N (25). While Tyr32 and Lys79 are highly 
conserved in phleboviruses (corresponding to Tyr30 and 
Lys74 in RVFV; Supplementary Figure S4), Lys204 in 
Toscana virus N is replaced by Gin 198 in RVFV, which sim- 
ilarly could contact the phosphate backbone. Surprisingly, 
mutating the invariant Tyr30 or Lys74 to alanine did not 
significantly hamper reporter activity (Figure 3F). How- 
ever, simultaneous mutation of both conserved residues re- 
sulted in a stable N, which was nonetheless inactive. This 
suggests that Tyr30 and Lys74 work in concert with me- 
diate viral replication. The Q198A mutant exhibited a re- 
duced activity relative to wild type, possibly pointing to an 
important function in viral replication (Figure 3F). How- 
ever, western blot analyses showed that Q198A was largely 
undetectable, indicating a general instability. 

Coupling of RNA binding to an inter-subunit rearrangement 

The four hexameric rings in the asymmetric unit of the 
RNA-bound N hexamers were stacked in a bottom-to- 
bottom assembly, e.g. in the opposite orientation as in the 
apo structure (Supplementary Figure S3). Every second N 
monomer was laterally displaced relative to the neighboring 
N leading to a staggered assembly of the hexameric ring. 
In this way, the six-fold symmetry in the N hexamer was 
broken, resulting in three -fold symmetry of the RNP (Fig- 
ure 4A and B). Considerable differences were apparent in 
the inter-subunit assembly of N at the 5' and 3' ends of the 
9-mer RNA molecules. In the following, we refer to the N 
interplay at the 5' RNA end, which appears to represent a 
functional assembly akin to helical RNP structures (see be- 
low). 

Analysis of the inter-subunit assembly revealed that the 
position of Tyr32, and in fact the whole N-terminal arm, 
relative to the associating N was almost identical in the apo- 
and RNA-bound forms (Figure 4C). However, the simulta- 
neous binding of Lys204 and Lys79 of the adjacent subunit 
to the phosphate backbone of the RNA led to a 45° rotation 
between the subunits, with Tyr32 working as a pivot point 
(Figure 4C and D). This domain movement created a new 
inter-subunit interface of 480 A 2 at the outside of the hex- 
amer, which did not involve the amino-terminal arm (Fig- 
ure 4E and F). It included several hydrogen bonds and a 
salt bridge between Asp 173 and Argl91 from two oppos- 
ing subunits, which moved 17 A toward each other. At the 
inter-subunit junction at the 3' RNA end, Lys204 and Lys79 
did not bind RNA and occupied a similar position as in the 
apo form relative to the neighboring monomer (Supplemen- 
tary Figure S8). Also in the RNA- and DNA-bound ring- 
like RNP structures of RVFV, Glnl98 does not bind to nu- 
cleic acids. Consequently, no marked inter-subunit rotation 
is observed in these structures and no additional interface 
is formed (Supplementary Figure S9). 

To clarify the importance of this new interface, we in- 
troduced mutations and analyzed the consequence for viral 
transcription and RNP production (Figure 4G). The sin- 
gle point mutant R185D was expressed similarly to wild 



type, but exhibited greatly reduced activity in the reporter 
gene assay. The double mutant, D167A/R185D, was un- 
stable and, accordingly, also did not show reporter activity. 
These data are consistent with an important role of this in- 
terface for RNP activity. 

To further characterize this domain rearrangement, bind- 
ing of shorter RNA oligomers to N was evaluated. As ex- 
pected, consecutive shortening of the RNA gradually re- 
duced the affinity for N (Supplementary Figure S2E, F and 
G). Surprisingly, the binding stoichiometries of the shorter 
RNA oligomers (6-8 mers) did not significantly differ from 
the 9-mer RNAs (n — 0.4-0.5). Since each N binds seven 
RNA bases, the 6-mer RNA would have enough space to 
bind to adjacent Ns in the hexameric ring. Our data sug- 
gest that RNA binding and the associated structural rear- 
rangements of adjacent subunits prevent binding of a sec- 
ond RNA molecule in the context of a hexameric ring. 

Model for the helical RNPs 

Our EM analysis suggested that Toscana virus N forms 10- 
nm RNP fibers upon interaction with longer RNAs (Fig- 
ure 1C). To obtain structural insights into RNP formation, 
we applied the observed inter-subunit shift observed at the 
5'-RNA end to all consecutive subunits. This transforma- 
tion resulted in a left-handed N helix with an outer diam- 
eter of 10 nm (Figure 5). Each turn of this helix contained 
5-6 N corresponding to the sequestration of 35-42 RNA 
nucleotides. In this model, long RNA sequences could be 
encapsidated in a continuous RNA-binding tunnel at the 
inside of the RNP involving consecutive Ns. 

DISCUSSION 

Phleboviruses are important human pathogens, and study- 
ing their mechanism of RNA encapsidation and replica- 
tion is of vital importance. In the current study, we an- 
alyzed structures of uncomplexed and RNA-bound N of 
Toscana virus. These structures show that the gross features 
of oligomerization and RNA binding in Ns are conserved 
among phleboviruses (22-24). Oligomerization is mediated 
by an amino-terminal arm that binds in a hydrophobic 
pocket of the next monomer. RNA, in turn, is sequestered in 
a positively charged cavity at the inside of the oligomer, in a 
mostly sequence -independent fashion. However, it has been 
unclear how the hexameric planar architectures of the apo 
Ns of RVFV and Toscana virus are compatible with bind- 
ing of longer RNA sequences. Also the RNA- and DNA- 
bound structures of RVFV RNPs show ring-like architec- 
tures, which do not explain how longer nucleotide sequences 
can be bound (23). 

Here, we describe a molecular mechanism how RNA 
binding can be coupled to an inter-subunit movement in the 
hexameric N inducing the transition from a ring-like to a 
helical RNP. Central to this inter-subunit movement is the 
interplay between Tyr32 and Lys204 in one subunit, Lys79 
in the adjacent subunit and their interactions with RNA. 
In the absence of RNA, Tyr32, Lys204 and Lys79 localize 
closely together in the RNA-binding tunnel. At the 5' end of 
our 9-mer RNA, they become separated upon RNA bind- 
ing. Tyr32 and Lys79 cooperate to bind the RNA from the 
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Figure 4. RNA binding induces an inter-subunit shift. (A) Side view of three subunits of the apo N ring showing a planar assembly of the subunits. 
(B) Same view on the RNP, detailing the staggered assembly induced by RNA binding. The two peripheral subunits are displaced relative to the central 
subunit (indicated by arrows). (C) One monomer of the apo structure (brown) was superimposed with a monomer of the RNP structure (white; this subunit 
did not contain 9-mer RNA) at the 5' RNA end. The relative position of the neighboring subunits is depicted (yellow: apo structure; blue: 9-mer RNA- 
bound subunit). A magnification of the box is shown at the right. While Tyr32 of the yellow and the blue monomer occupy a similar position relative to 
the white/brown monomer, Lys204 undergoes an 11-A displacement. (D) Side view of panel (A) showing that RNA binding induces a 45° tilt between 
neighboring subunits. (E) View on the apo structure from the outside of the hexamer. The inter-subunit contact is almost exclusively mediated by the amino 
terminal arm. Arg213 stabilizes the amino-terminal arm by binding to the amide backbone of Asn28 in the same chain. (F) Same view of the Toscana 
virus RNP structure showing that a new inter-subunit interface is formed upon RNA binding. Arg213 forms a hydrogen bond to the peptide backbone 
of Leul31 while Arg217 forms a hydrogen bond to Thrl37. A new salt bridge is formed between Aspl73 and Argl91 of the neighboring subunit. (G) 
Minireplicon assay with the indicated mutants, as in Figure 3F. 
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Figure 5. Model for the helical assembly of Toscana virus RNP. (A) The helical assembly of Toscana virus RNP was modeled by extending the observed 
inter-subunit shift at the 5' RNA end to the adjacent subunits. This transformation results in a left-handed helical filament with an external diameter of 
10 nm. RNA can be bound in a continuous RNA-binding groove at the inside of the helix (indicated in red). (B) A top view of this helix. (C) Schematic 
representation of the proposed steps involved in the formation of the RNP filament. Binding of longer RNA sequences to the N hexamer results in an 
upward shift of the neighboring subunit akin to a helical RNP. The consecutive recruitment of further subunits through both the amino-terminal arm and 
an outer interface formed by the D173-R191 salt bridge may extend this hexamer. 



upper side, whereas Lys204 contacts the phosphate back- 
bone from the lower lobe. In this way, the RNA-binding 
cleft is opened and a continuous RNA-binding tunnel is cre- 
ated. This situation is somewhat reminiscent of Lassa virus 
RNPs where a flexible loop regulates the open and closed 
conformations of the RNA tunnel (13). The conformational 
changes resulting from RNA binding are then translated 
into an inter-subunit rotation creating a new assembly in- 
terface at the outside of the oligomer. Mutations in this 
new interface reduce the transcriptional activity, as appar- 
ent in the minireplicon assay. We propose that the observed 
inter-subunit architecture at our 5' 9-mer RNA represents 
the architecture of every inter-subunit junction in a func- 
tional RNR The almost identical assembly of all 12 inter- 
subunit junctions in four independent RNPs in the asym- 
metric unit argues for a defined state of the inter-subunit 
junction, rather than a random displacement of Ns in the 
crystal lattice. Most importantly, such architecture would 
allow binding of longer RNA sequences in a continuous he- 
lical RNP oligomer. To our knowledge, this is the first ex- 
ample describing the molecular mechanism of how RNA 
binding induces transition from a planar ring to a helical 
structure in an RNP assembly. 

Why is such transition not observed in the crystal struc- 
tures of RVFV RNPs or at the 3' end of the RNA molecule? 
Also in the RVFV RNPs, the three residues in the inter- 



subunit junction (Tyr30, Lys74 and Glnl98) separate upon 
RNA binding. However, Glnl98 does not contact the RNA 
and, consequently, no inter-subunit rotation is observed. 
The RNA and DNA sequences used for the crystallization 
of the RVFV RNP symmetrically fit into one ring at the in- 
side of the RNP. We speculate that, in these cases, a closed 
ring structure might be more stable than an open helix 
turn and therefore be favored for crystallization. Similarly, 
Lys204 and Lys79 do not contact the RNA backbone at the 
3' RNA end of our Toscana virus RNP and, consequently, 
no major rotation is observed. The 9-mer RNA may be too 
short at this side and/or the induction of a similar shift as 
at the 5' RNA end would induce an open ring and there- 
fore be energetically unfavored. Furthermore, it can be en- 
visaged that the proposed inter-subunit shift is energetically 
not favoured over the planar architecture, since we do not 
observe it in the apo N hexamer. Instead, it may be driven 
by RNA binding. This may also explain why the K204A 
and K79A mutants still bind with similar affinity compared 
to wild-type N to our 9-mer RN A — the loss of binding en- 
ergy due to these mutations may be compensated by a re- 
duced inter-subunit shift. In support of such hypothesis, the 
in vitro reconstituted RNPs of the K204A mutant appear 
shorter compared to wild type, and we see reduced activity 
of the corresponding RVFV N proteins K74A/Y30A in the 
minireplicon assays. 
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It has been known for some time that phlebo- and 
tenuiviruses are evolutionarily related (42). Both have mul- 
tipartite genomes with identical 5' termini nucleotides (5'- 
ACACAAAG) and complementary 3' nucleotides. Their 
segmented genomes are both negative and ambisense and 
the encoded proteins show significant sequence and struc- 
tural similarity. Accordingly, the phlebovirus and tenuivirus 
N belong to the same protein family (Pfam 05733). Further- 
more, structural predictions reveal significant homology be- 
tween the Ns of Toscana and Rice hoja blanca and maize 
stripe tenuiviruses (Supplementary Figure SI OA and B) 
(43-44). Interestingly, tenuivirus RNPs can assemble into 
helical and superhelical structures (Supplementary Figure 
S10C) (45). Also the reported diameter of the tenuiviruses 
(8 nm) agrees well with the diameter of our RNP model he- 
lix (10 nm). We therefore envisage that our proposed mech- 
anism for the helical assembly induced by RNA sequestra- 
tion in Toscana virus RNPs is applicable to a wider range 
of viruses. 
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